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A THEORETICAL FRAMEWORK FOR COMBINING TECHNIQUES THAT PROBE THE LINK BETWEEN 

GALAXIES AND DARK MATTER. 

Alexie Leauthaud 1,2 , Jeremy Tinker 3 , Peter S. Behroozi 4 , Michael T. Busha 4 ", Risa H. Wechsler 4 

ABSTRACT 

We develop a theoretical framework that combines measurements of galaxy-galaxy lensing, galaxy 
clustering, and the galaxy stellar mass function in a self-consistent manner. While considerable effort 
has been invested in exploring each of these probes individually, attempts to combine them are still in 
their infancy. These combinations have potential to elucidate the galaxy-dark matter connection and 
the galaxy formation physics that is responsible for it, as well as to constrain cosmological parameters, 
and to test the nature of gravity. In this paper, we focus on a theoretical model that describes the 
galaxy-dark matter connection based on standard halo occupation distribution techniques. Several 
key modifications enable us to extract additional parameters that determine the stellar-to-halo mass 
relation and to simultaneously fit data from multiple probes while allowing for independent binning 
schemes for each probe. We construct mock catalogs from numerical simulations to investigate the 
effects of sample variance and covariance for each probe. Finally, we analyze how trends in each 
of the three observables impact the derived parameters of the model. In particular, we investigate 
various features of the observed galaxy stellar mass function(low-mass slope, "plateau" , knee, and 
high-mass cut-off) and show how each feature is related to the underlying relationship between stellar 
and halo mass. We demonstrate that the observed "plateau" feature in the stellar mass function at 
M* ~ 2x 10 10 M Q is due to the transition that occurs in the stellar-to-halo mass relation at Mh ~ 10 12 
M Q from a low-mass power-law regime to a sub-exponential function at higher stellar mass. 
Subject headings: cosmology: observations - gravitational lensing - large-scale structure of Universe 



1. INTRODUCTION 

Improved measurements of the link between galaxies 
and the dark matter distribution will benefit a variety 
of cosmological applications but will also provide im- 
portant clues about the role that dark matter plays in 
galaxy formation process. Although multiple techniques 
have been developed for this purpose, no single method 
has yet emerged as the ultimate tool and all suffer from 
various drawbacks. The goal of this paper is to de- 
velop the theoretical foundations required to combine 
multiple probes into a single tool that will provide more 
powerful constraints on the galaxy-dark matter connec- 
tion. This paper extends and complements a growing 
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Becker et al.l 120071: iNorberg et al.l 120081: iMore et all 
20091 120111 ). Galaxy-galaxy lensing (hereafter "g-g 
lensing") utilizes subtle distortions induced in the 
shapes and orientations of distant background galaxies 
in order to measure foreground mass distributions. 
The satellite kinematic method uses satellite galaxies 
as test particles to trace out the dark matter velocity 
field. Neither method can probe the halos of individual 
galaxies. Instead, both techniques must stack an ensem- 
ble of foreground galaxies in order to extract a signal. 
Nonetheless, with the advent of data-sets large enough to 
provide statistically significant samples, improvements 
in photometric redshift techniques, and spectroscopic 
follow-up programs, both methods have emerged as 
powerful probes of the galaxy-dark matter connection 
and have truly evolved into mature techniques over the 
last decade. 

In addition to these two direct probes, there are 
also several popular indirect methods to infer the 
galaxy-dark matter connection from the statistics of 
galaxy clustering. For example, numerous authors 
have employed a statistical model to describe the 
probability distribution P(N\Mh) that a halo of mass 
Mh is host to N galaxies above some threshold in 
luminosity or stellar-mass. This statistical model, 
commonly known as the halo occupation distribu- 
tion (HOD), has been considerably successful at in- 
terpreting the clustering properties of galaxies (e.g., 
' Seliakl 120001: iPeacock fc Smith! 120001: iScoccimarro et all 
20011: IBerlind fc Weinberg! 120021: iBullock et al.l 120021: 



2 



A. Leauthaud et al. 



Zehavi et al. 


|200 


I. 120051: IZheng et al.l 120051 


2007; 


Tinker et al. 


2007; 


Wake et alJl20lH iZehavi et al.l 


201C; 


White et al.l 


2011) 


The HOD provides a description 



of the spatial distribution of galaxies at all scales, 
but it is usually inferred observationally by modeling 
measurements of the two-point correlation function of 
galaxies, £, gg (r). Since they were introduced a decade 
ago, HOD models have progressively increased in fi- 
delity and complexity owing to stronger observational 
constraints but also to the availability of larger, high- 
resolution cosmological N-body simulations of the dark 
matter. For example, analytical descriptions of the 
form and evolution of the halo mass function and the 
large-scale halo bias, both of which are key ingredients 
for HOD models, are approaching percent-level preci- 
sion (e.g., ITinker et afl I2008L 120101 ) . A variety of ex- 
tensions to the basic HOD framework have also been 
proposed. For example, the conditional luminosity func- 
tion §{L\Mh)dL specifies the average number of galax- 
ies of lum inosity L ± dL/2 that reside in a halo of mass 
Mh Ce.g.. lYang et alJ|200l Ivan den Bosch et ai1l2003bl; 



Vale fc Ostrikerll2004t 'Cooravll2006Hvan den Bosch et alj 
2007t iVale fc Ostrikerl |2008| ) and the conditional stellar 
mass function $(Af* \Mh)dM t describes the average num- 
ber of galaxies with stellar masses in th e range ± dM* 
as a function of host halo mass Mh (e.g..lYang et al.ll200l 
iMoster et al.1 120101: IBehroozi et al.l [201011 Furthermore, 
a number of studies are also starting to take into account, 
not only the simple expectation values of the underlying 
relations, but als o the scatter betw e en the observable and 
halo-mass Ce. g.. iMore et al.l 120111 IBehroozi et ail 12011 
IMoster et allboiOD . a crucial ingredient for a complete 
description of the galaxy-dark matter connection. 

Finally, halo mass constraints from the galaxy stellar 
mass function (hereafter "SMF" ) have also been derived 
by assuming that there is there is a monotonic corre- 
spondence between halo mass (or circul ar velocity) and 
galax y stellar mass (or l umino si ty) (e.g.. iKravtsov et all 
2004 IVale fc Ostrikerl 120041: iTasitsiomi et all 1 200 ' 
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referred to as "abundance matching" , is economic in 
terms of data requirements since it only considers the 
observed stellar mass (or luminosity) function. However, 
prior knowledge about the mass distribution of halos 
(and substructure within those halos) from cosmological 
N-body simulations is necessary as well as the assump- 
tion that field halos and subhalos of the same halo mass 
contain galaxies of the same stellar mass. 

While considerable effort has been invested in explor- 
ing each of these probes individually, attempts to com- 
bine them in a fully consistent way are still in their in- 
fancy. Nevertheless, savvy combinations hold great po- 
tential to not only elucidate the evolution of the galaxy- 
dark matter connection, and consequently the galaxy for- 
mation physics responsible for it, but to also constrain 
fundamental physics, including the cosmological model 
and the nature of gravity. For example, measurements 
of small-scale galaxy clustering alone do not yield cos- 
mological constraints unless coupled with probes that 
are sensitive to the mass scales of dark matter halos 
(e.g., cluster m ass-to- light ratios, satellite k i nematics, g-g 
lensing, etc.) ([van den Bosch et al.l l2003al ; ITinker et al.1 



[20051: ISeliak et al.l[2005T). I n particular, [YooeLajJ (|2006h 
and ICacciato et al.l ( 20091) have shown that the combi- 
nation of g-g lensing and galaxy clustering is sensitive 
to Q m and o~%. Conceptually, this sensitivity arises from 
the fact that this particular combination simultaneously 
probes the shape and amplitude of the halo mass func- 
tion at small scales and the overall matter density and 
the bias of the galaxy sample at large scales. 

Other combinations can be sensitive to parameters in 
modified gravity theories. A generic metric theory of 
gravity has two scalar potentials, </>, which affects the 
clustering and dynamics of galaxies, and tp, which affects 
the lensing of light around galaxies. Combining probes 
of g-g lensing with clustering and/or satellite dynamics 
allows a test of the general relativity (GR) prediction 
that ip — <j) as well as the Poisson equations which relate 
these potenti als to the underlyin g density distribution. 
For example, iReves et al.l (j2010| ) have used a combina- 
tion of g-g lensing, galaxy clustering, and redshift space 
distortions to place limits on possible modifications to 
GR on ~10 Mpc scales. 

Tests of gravity on smaller scales, though complicated 
by the fact that structures have undergone non-linear 
evolution, are interesting in several respects. First, all of 
our direct probes of the dark matter are either intrinsi- 
cally limited to small scales (e.g., a few Mpc for satellite 
kinematics) or have significantly larger signals on small 
scales (e.g., below 10 Mpc for both cosmic shear and 
g-g lensing). Second, there are many alternative grav- 
ity theories which predict unique and i nteresting mod- 
ifications on these scales dSmithl [2009t iHui et all 120091 : 
ISchmidtl[20T0l:IJain fc Khourvll2010D . 

In this paper we develop the theoretical framework nec- 
essary to constrain the galaxy-dark matter connection by 
combining measurements of galaxy clustering, g-g lens- 
ing, and the galaxy SMF. The formalism outlined in this 
paper c ould also be appl ied to model satellite kinemat- 
ics (see lMore et alll2"011[) . For this work, we adopt the 
standard HOD framework but with several key modifi- 
cations. For instance, a procedure often adopted in clus- 
tering studies is to fit a set of HOD parameters (typi- 
cally three to five) independently to the clustering signal 
and number density for each galaxy sample. However, 
adopting this strategy would require selecting a common 
binning scheme for all probes. In practice, we would 
like to avoid using a single binning scheme because var- 
ious probes have different signal-to- noise (S/N) require- 
ments. We therefore modify the standard HOD model so 
that we can simultaneously fit data from multiple probes 
while allowing for independent binning schemes for each 
probe. Also, since we are interested in the galaxy-dark 
matter connection, we modify the HOD model so as to 
specifically include a parameterization for the stellar-to- 
halo mass relation (hereafter "SHMR"). In Leauthaud 
et al. 2011b (hereafter Paper II), we demonstrate that 
this model provides an excellent fit to g-g lensing, galaxy 
clustering, and stellar mass function measurements in the 
COSMOS survey from z = 0.2 to z = 1.0. 

For 2 deg 2 surveys such as COSMOS, the finite sam- 
ple size of the observational data set is also an impor- 
tant concern. We will present an estimate of the sample 
variance using mock surveys from numerical simulations. 
We will also estimate the covariance of the data for each 
observational measure. We will demonstrate that this 
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is especially important for modeling the SMF, an effect 
that is usually not incorporated into most analyses. The 
finite sample size in COSMOS biases clustering measure- 
ments through the integral constraint, an effect we will 
also model through our mock surveys. 

The layout of this paper is as follows. To begin with, we 
introduce the parametric form used to model the SHMR 
in $2] Next, in $3) we present the general HOD frame- 
work and our extensions to this model. In we show 
how this model can be used to simultaneously fit g-g 
lensing, galaxy clustering, and SMF measurements. In 
$51 we describe the influence of each model parameter on 
the three obscrvables. We then construct a set of mocks 
catalogs designed to mimic the COSMOS survey and de- 
scribe the behaviour of the covariance matrices for the 
three probes in fJ5J Finally, we draw up our conclusions 
in S3 

We assume a WMAP5 ACDM cosmology with m = 
0.258, Q A = 0.742, Q h h 2 = 0.02 273, n s = 0.963 a H = 
0.796, H Q = 72 km s" 1 Mpc" 1 (jHinshaw et al.ll2009l ). 
Unless stated otherwise, all distances are expressed in 
physical Mpc. The letter Mh denotes halo mass. The 
halo radius is noted Rh- In this paper, halo mass is 
defined as M 20 ob = M(< i?200b) = 200p|7ri?2 00 h where 
R200b is the radius at which the mean interior density 
is equal to 200 times the mean matter density (p). We 
note however that our theoretical framework is valid for 
any reasonable choice of halo definition. Stellar mass is 
noted M*. 

2. THE STELLAR-TO-HALO MASS RELATION FOR 
CENTRAL GALAXIES 

To begin with, we present the mathematical function 
that we use to model the SHMR and we describe the 
influence of each of the five parameters that regulate its 
shape. We will assume that the SHMR is specifically 
valid for "central" galaxies which are located by defi- 
nition at the center of their parent halos. Dark matter 
halos also contain smaller bound density peaks that orbit 
around the center of the potential well. These substruc- 
tures are commonly referred to as sub-halos; these sub- 
halos are the likely sites of "satellite" galaxies that have 
been accreted onto their parent halos. The abundance 
matching technique commonly assumes that satellite 
galaxies follow the same SHMR as centrals provided that 
halo mass is defined at the epoch when satellites were 
accreted onto their par ent halos (M ac ^), rather than the 
current sub-halo mass (iConrov c t aL 20061 iMoster et afl 
120101 : iBehroozi et al.ll2010l) . However, this pre-supposes 
that satellite stellar growth occurs at a similar rate as 
centrals of equivalent halo mass (that is to say with a 
halo mass equal to M acc ). Since one might expect that 
satellites and centrals experience distinct stellar growth 
rates, we model central and satellite galaxies separately 
in order to keep our model as general as possible. 

In § 13.21 we will show how the SHMR can be used to 
predict the central occupation function and then we will 
introduce the model for satellite galaxies in § 13.31 

2.1. Functional form for the SHMR 

Let us consider the conditional stellar mass func- 
tion (the analog of the conditional luminosity function) 
which represents the number of galaxies with M* in the 
range M* ± dM*/2 at fixed halo mass and is noted 



<f>(MJM h ) (e.g.. lYang et all [200l IMoster et all I20T0I: 
IBehroozi et al.ll2010D . The conditional stellar mass func- 
tion can be divided into a central component and a satel- 
lite component: $(M*|M fe ) = ^ c {M it \M h ) + ^ a {M it \M h ). 
&c(M*\Mh) is the conditional stellar mass function for 
central galaxies, and it will be our mathematical rep- 
resentation of the SHMR. Note that in our model, the 
halo mass in the term $ s (M*|M/j) refers to the host halo 
mass. 

In addition to the shape and evolution of the mean 
SHMR, astrophysical processes are expected to induce an 
intrinsic scatter in stellar mass at fixed halo mass which 
it is important to take into consideration when defining a 
functional form for $ c (M H ,|M/ l ). Another non negligible 
source of scatter can be the measurement error associ- 
ated with the determination of stellar masses. In the 
absence of strong observational or theoretical guidance 
for the form and magnitude of the total scatter (intrinsic 
plus measurement), we adopt a stochastic model where 
<I> c (Mh,|M/j) is a log-normal probability distribution func- 
tion (hereafter "PDF" ) with a log- normal scatteiQ noted 
fiogM, • Since we have assumed a log-normal functional 
form, $ c (Af*|Mh) can be written as: 



* c (M,|M h ) 



1 



cxp 



ln(10)CTi og M, v27r 
[log 10 (A/«) - log 10 (/ SHM R(M ft ))] 



2fT 2 

za logM„ 



(1) 



where / SEMR represents the logarithmic mean of the stel- 
lar mass given the halo mass for the $ c distribution func- 
tion. Equation Q] is normalized such that the integral of 
$ c (M*|Mfe) over M* is equal to 1. 

To model $ c we must specify a functional form for 
both /shmr and for <7i og M„ ■ There is increasing evidence 
to suggest that low and high mass galaxies have differ- 
ent stellar-to-halo mass ratios, probably as a result of 
multiple feedback mechanisms that operate at distinct 
mass scales and regulate star formation. We therefore 
require a SHMR that is flexible enough to capture such 
variations. We adop t the functional form presented in 
IBehroozi et al.l ()2010D (hereafter "B10") which has been 
shown to reproduce the local SDSS stellar mass function 
using the abundance matching technique. In practice, 
/shmr(-W/i) is mathematically defined following via its in- 
verse function: 



logio(/ S HMR(^*)) =log 10 (Af h ) 



log 10 (Mi)+£ log 10 



(A) 



1 



( M, 
\M,, 



(2) 



where M\ is a characteristic halo mass, M* t o is a char- 
acteristic stellar mass, /3 is the low mass end slope, 7 
controls the transition region, and 6 controls the mas- 
sive end slope. Details regarding the justification of this 
functional form can be found in § 3.4.3 of B10. 

1 Scatter is quoted as the standard deviation of the logarithm 
base 10 of the stellar mass at fixed halo mass. 
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Note that a variety of similar functional forms have 
been proposed by previous authors. For example, the 
intere sted reader can look at Equation 2 inlMoster et all 
(|2010f) and Equation 20 in I Yang et all (|2009f l. 

In contrast to BIO, we do not parametrize the redshift 
evolution of this functional form. Instead, in Paper II, we 
bin the data into three redshift bins and check for redshift 
evolution in the parameters a posteriori. Another differ- 
ence with respect to BIO is that we assume that Equation 
[2] is only relevant for central galaxies whereas BIO assume 
that the SHMR also applies to satellite galaxies, provided 
that the halo mass of a satellite galaxy is defined as M acc . 

It is important to note that our SHMR traces the 
location of the mean-log stellar mass: /shmr(-^i) = 
(log 10 (M»(M/ l ))). Other authors may report the mean 
stellar mass, (M»(M/j)), or even th e mean halo mass a t 
fixed stellar mass, (Mh(M*)) (e.g., IConrov"eTlII[27)07h . 
These averaging systems will yield different results in the 
presence of scatter. For example, (Mh(M*)) will be bi- 
ased low compared to (log 10 (M*(Mh))) if <j\ 0& m, is non 
zero. This bias will increase with <7i og M» and for larger 
values of the high-mass slope of / S hmh,- 

In Figure Q] we illustrate the impact of the five param- 
eters that determine / SH mr on the shape on the SHMR. 
A brief description is as follows: 

• Mi controls the characteristic halo mass; increas- 
ing Mi will result in larger halos hosting galaxies 
at a given stellar mass. In Figure Q] (which repre- 
sents M* on the x-axis and Mh on the y-axis) : Mi 
controls the y-axis amplitude (halo mass) of / S hmr 
so that a constant change in Mi leads to a con- 
stant up/down logarithmic shift in f^ m - Note that 
this constant logarithmic shift may not be visually 
obvious because the slope of the SHMR increases 
sharply at M» > 10 11 M Q . 

• M* ; o controls the characteristic stellar mass; in- 
creasing M*fi will result in smaller halos hosting 
galaxies at a given stellar mass. In Figure [TJ M*^ 
controls the x-axis amplitude of / s "j,J IB . 

• P controls the low mass power-law slope of / S hmr- 
When (3 increases, the low mass end slope becomes 
steeper. 

• The 5 parameter regulates how rapidly / S gJ, R climbs 
at high M*. Indeed, asymptotes to a sub- 
exponential function at high M* which signifies 
that /shmr climbs more rapidly than a power-law 
function but less rapidly than an exponential func- 
tion (see discussion in BIO). 

• 7 controls the transition regime between the low- 
mass power-law regime and the high-mass sub- 
exponential behaviour. A larger value of 7 cor- 
responds to a more sharp transition between the 
two regimes. 

A quantity that is of particular interest is the mass (we 
refer here to both M* and Mh) at which the ratio M^/M* 
reaches a minimum. This minimum is of noteworthy im- 
portance for galaxy formation models because it marks 
the mass at which the accumulated stellar growth of the 



central galaxy has been the most efficient. In this pa- 
per, and in subsequent papers, we will refer to the stellar 
mass, halo mass, and ratio at which this minimum occurs 
as the "pivot stellar mass", M* lv , the "pivot halo mass", 
M,P iv , and the "pivot ratio", (M h /M*) piy . Note that 

M» lv and M£ 1V are not simply equal to Mi and to M*^. 
Indeed, the mathematical formulation of the SHMR is 
such that the pivot masses depend on all five parame- 
ters. The three parameters that have the strongest effect 
on the pivot masses are Mi, M +> o, and 7. For example, 
as can be seen in the right hand panel of Figure [TJ MJ? IV 
and M plv are inversely proportional to 7. To lesser ex- 
tent, the two remaining parameters, /3 and 5 also have a 
small influence on the pivot masses. 

2.2. Scatter between stellar and halo mass 

We now turn our attention to the second component of 
$ c (M»|Mji) which is the scatter in stellar mass at fixed 
halo mass, o~i og M t ■ The total measured scatter will have 
two components: an intrinsic component (noted crj M> ) 
and a measurement error component due to redshift, 
photometry, and modeling uncertainties in stellar mass 
measurements (noted c™ gM ^ ) . It is reasonable to assume 
that the intrinsic scatter component is independent of 
the measurement error component. Assuming Gaussian 
error distributions, we can write that: 



(ciogM,) 2 - (o-fogM,) 2 + (°SgM.) 



2 



(3) 



While the error distribution for the stellar mass esti- 
mate for any single galaxy may be non Gaussian, in this 
work we are only concerned with stacked ensembles. BIO 
have tested that the error distribution for a stacked en- 
semble is Gaussian to good approximation and that small 
non-Gaussian wings in this distribution are not likely to 
affect this type of analysis. 

In practice, since the data are always binned accord- 
ing to M», the observables are actually sensitive to the 
scatter in halo mass at fixed stellar mass which we note 
o'logMh . It will therefore be useful to understand the link 
between cri og M h and eriogM, ■ If the SHMR is a power law, 
the relationship between <7io g M h and <xi og M,, is simply: 



d(log 10 M h ) 



(4) 



For example, if there is a power law relation between 
halo mass and stellar mass such that Mh = M* then 
f iogM h = T) x ci gM„ • In our case, the SHMR behaves 
like a power- law at low M*. At high M*, however, 
d(log 10 M/ t )/d(log 10 M») increases as a function of M*. 
Therefore if <xi og M„ is constant, 0"i O gM h w iU b e equal to 
ClogMh = P X ci gM, a t l° w but then will continuously 
increase with M* at a rate set by 7 and S. 

If we adopt the best fit model to the SHMR from Pa- 
per II in the redshift range 0.22 < z < 0.48, we find 
that the power-law index of the SHMR increases steeply 
at log 10 (M*) > II so that CTi og M h becomes quite large. 
For example, ciogM h ~ 0.46 dex at log 10 (M») = 11 and 
oiogM h ~ 0.7 dex at log 10 (M*) = 11.5. In practical 
terms, this implies that the most massive galaxies do 
not necessarily live in the most massive halos. For ex- 
ample, a galaxy with ~ 2 x 10 11 M could be the 
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Fig. 1. — Illustration of the influence of Mi, Af»,o, ft, <5 and 7 on the shape of the SHMR. Mi controls the characteristic halo mass. Af * 
controls the characteristic stellar mass. /3 controls the low mass power-law slope. S regulates how rapidly the SHMR climbs at high M*. 7 
controls the transition regime between the low-mass power-law regime and the high-mass sub-exponential behaviour. 



central galaxy of of group with Mh ~ 10 — 10 M , 
or could also be the central galaxy of a cluster with 
Mh > 10 15 M . The increase of ciogM h with M* will 
lead to a noticeable effect in the g-g lensing, clustering, 
and stellar mass functions at large M* that is analogous 
to Eddington bias. This effect will be discussed in further 
detail in § [5j 

3. HOD FRAMEWORK 

In this section, we show how <£> C (M* |M/j) can be used to 
determine the central halo occupation function and we 
introduce five new parameters to describe the satellite 
occupation function. 

3.1. Halo Occupation Functions 

In this paper, we assume that stellar mass is used to 
implement the HOD model since it is expected to be a 
more faithful tracer of halo mass than galaxy luminosity. 

Consider a galaxy sample such that M* > M* 1 (a 
"threshold" sample). The central occupation function, 
noted (N ccn (Alh\Ml 1 )) , is the average number of cen- 
tral galaxies in this sample that are hosted by a halo 
of mass Mh- The satellite occupation function, noted 
(A r sat (M^|Af* 1 )), is the equivalent function for satellite 
galaxies. 

In what follows, we focus on the appropriate equations 
for threshold samples. In Paper II however, we will use 
"binned" samples (M* 1 < M* < M* 2 ) to calculate the 
g-g lensing and the SMF. We therefore note that the 
occupation functions for binned samples are trivially de- 
rived from the occupation function for threshold samples 
via: 



(N cen (Mh \Ml\ Ml 2 )) = (N con (M h IM* 1 ))- (N 



and 



*(M h \Mt')), 
(5) 



(N sat (M h \Mi\Mt 2 )) = (N sat (M h \M^))-(N slit (M h \M^)}. 

(6) 



3.2. Functional form for (N cen ) 
For a threshold sample of galaxies, (iVcenO^I-^* 1 )) is 



fully specified given $ C (M*|M/ 1 ) according to: 



(N cen (M h \M^)} 



<S> c (M*\M h )dM* 



(7) 
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Because the integral of $> c (M*\Mh) over M* is equal 
to 1, (N^MhlMl 1 )) will vary between and 1. 

To begin with, let us make the simplifying assumption 
that (TiogM, is constant. Because $ c is parametrized as a 
log-normal distribution, the central occupation function 
can be analytically derived from Equation [7] by consider- 
ing the cumulative distribution function of the Gaussian: 



(N cen (M h \M^)} 



1 - erf 



log 10 (M*i)-log 10 (/ S HM R (M h )) 



V^ClogM, 

where erf is the error function defined as: 



erf(x) = —= 



e~~* dt. 



(8) 



(9) 



It is important to note that Equation [8] is only valid 
when (TiogM, is constant. In the more general case where 
ciogM, varies, (N ccn ) can nonetheless be calculated by 
numerically integrating Equation [7] In Paper II we will 
consider cases in which ct\ os m* varies due to the effects 
of stellar mass dependant measurement errors. In this 
case, we will numerically integrate Equation [7] to calcu- 
late (N cen ) (see § 4.2 in Paper II). 

We note that most readers may be more familiar with 
a simplified version of Equation [8] that assumes that 
/shme(A4) is a power law. We will now describe the as- 
sumptions made in order to obtain the more commonly 
employed equation for (N ccn ) from Equation [8] 

If we make the assumption that /shmr(A^i) ck and 
we define M min such that M min = f^^Ml 1 ) (in other 
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terms, M m ; n is the inverse of the SHMR relation for the 
stellar mass threshold M' 1 ) then using Equation [8] we 
can write that: 



(N cen (M h \M^)) 



1 - erf 



logio«i„)- logic «) 



\Z2crioi 



M. 



(10) 



If we now use the fact that erf(— x) = — erf(x) and if 
we define eri og M such that cri og M = ciogM./p we can write 
that: 



(N cen (M h \M^)) 



1 + erf 



log 10 (M h ) - log 10 (M min ) 



M 



(11) 



which is a commonly employed formula for (N cen ). 
Firstly, it is important to note that Equation llll is only an 
approximation for (N cen ) for the case when the SHMR is 
a power-law and is certainly not valid over a large range 
of stellar masses. Secondly, <7i og M can be interpreted as 
the scatter in halo mass at fixed stellar mass if and only if 
the SHMR is a power-law and if <Ti og M. is constant. Since 
there is accumulating evidence that the SHMR is not a 
single power law (and the same is in general true for the 
relationship between halo mass and galaxy luminosity), 
we recommend using Equation [8] instead of Equation I 111 

Figures H] and [3] illustrate the difference in (N ccn ) when 
Equation[5]is used to describe clustering instead of Equa- 
tion 111! To make this figure, we have assumed the pa- 
rameter set: log 10 (Mi) = 12.71, log 10 (Af* ) = 11.04, 
P = 0.467, S = 0.62, 7 = 1.89, and eri ogM !=0.25. For 
each stellar mass threshold in Figure [2] we determine 
the values of M m - ln and <7i og M in Equation 1 1 1 1 such that 
the number density of central galaxies and the bias of 
those galaxies is the same as that achieved with Equa- 
tion [8] Thus our procedure mimics what one would ob- 
tain through analysis of the clustering and space density 
of such samples, assuming that the satellite occupation 
would be the same in either analysis. 

Figure [5] reveals that because the SHMR has a sub- 
exponential behaviour at log 10 (M*) > 10.5, (N ccn ) be- 
gins to deviate from a simple erf function for high stel- 
lar mass samples and therefore is not well described by 
Equation [TT] Assuming that Equation [8] correctly rep- 
resents (iV C en), the error made on M m i n can be of order 
10% to 40% at M h = /^(M* 1 ) > 10 12 M if Equation 
[TT1 is used to describe (N cen ) instead of Equation [8] 

We note that this does not invalidate Equation [TT] 
as a possible parameterization of the central occupation 
function. However, interpreting M m j n in Equation II II as 
/shmr^' 1 ) can result in a 10-40% error in the true mean 
halo mass (with larger errors for <7i og M„ > 0.25). Also, 
the "scatter" (<7i og M) constrained by this parametrization 
is not equal to the scatter in a log-normal distribution 
of stellar mass at fixed halo mass. Finally, one trouble- 
some aspect of using the erf functional form in Equation 
fTTI is that (N ccn ) curves for different stellar mass thresh- 
olds may actually cross at low halo mass, implying the 



unphysical condition that halos of mass M/j have a "neg- 
ative" amount of galaxies between two threshold values. 
This is seen at M h ~ 10 115 M Q in Figure [2] For values 
of ci og M t larger than what we have assumed here, this 
effect will occur at even higher halo mass. Using Equa- 
tion [8] with a model for the SHMR prevents this from 
occurring, and (N cen ) for various galaxy samples can be 
calculated self-consistently. 

3.3. Functional form for (N sat ) 
In addition to the five parameters introduced to model 
(iV C en) and cri og M» , we introduce five new parameters to 
model (A sat ). In order to simultaneously fit g-g lensing, 
clustering, and stellar mass function measurements that 
employ different binning schemes, we require a model for 
(N sa ±} that is independent of any given binning scheme. 
For this reason, we parameterize the satellite function 
with threshold samples. The number of satellite galaxies 
in a bin of stellar mass is determined by a simple dif- 
ference of two threshold samples. This also eliminates 
the need to integrate over stellar mass, as required when 
working explicitly through the conditional stellar mass 
function. 

Numerical simulations demo nstrate that t h e oc- 
cupation of sub- halos (e.g., iKravtsov et al.1 120041 : 
IConrov et al.l 120061 ) and satellit e galaxies in cosmolog- 
ical hydrodynamic simulations (1 Zheng et al.l |2005[ ) fol- 
low a power law at high host halo mass, then fall off 
rapidly when the mean occupation becomes significantly 
less than unity. Thus we parametrize the satellite occu- 
pation function as a power of host mass with an expo- 
nential cutoff and scaled to (N ccn ) as follows: 



(N sat (M h \M^)) = 
(N ccn (M h \AO) 



Mh 



exp 



-Mr 



M h 



(12) 



where a sa t represents the power-law slope of the satellite 
mean occupation function, M sa ± defines the amplitude of 
the power-law, and M cut sets the scale of the exponential 
cut-off. Here, Mh refers to the host halo mass of satellite 
galaxies. 

Observational analyses have demonstrated that there 
is a self-similarity in occupation functions such that 
Msat/Mmin ~ constant for luminosity-defined samples 
dZehavi et al.ll2005t IZheng et al 1120071 iTinker et al.ll2007t 
iZheng et al.ll2009l : lZehavi et al.ll2010UAbbas et al.ll2010t l. 
where M m - m is taken from Equation [TT] and is conceptu- 
ally similar to f^uniM* 1 ) (modulo a 10 — 40% difference 
as shown in Figure [3]) and where M* 1 is the stellar mass 
threshold of the sample. Instead of simply modelling 
M sat and M cut as constant factors of fsulm lM^ 1 ), we add 
flexibility to our model by enabling M sat and M cut to 
vary as power law functions of / S hJ ir (-^* 1 ) : 



1Q 12 M G 



and 



M r . 



10 12 M< 







fsHUujM^ 1 ) 

10 12 Af„ 



10 12 M o 



(13) 



(14) 
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Fig. 2. — Impact of the analytic model for the mean number of central galaxies in a given halo . Black lines show the form of (N cen ) for 
stellar mass threshold samples using Equation [8] Orange lines show (N ccn ) when Equation 1111 is use d to describe the clustering instead 
of Equation [8] The parametric form of the SHMR is a power law at low M* and thus Equation 1111 provides a reasonable description of 
(N cen ) at log 10 (Mt) < 10.5. At log 10 (M*) > 10.5 however, (N cen ) deviates from a simple erf function and there are noticeable differences 
between the two proposed forms for (N ceJ1 ). 
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Fig. 3. — Difference between two analytic models for the mean 
number of central galaxies in a given halo. Assuming that Equation 
[8] correctly represents (N cen ), we evaluate the difference between 
the value for Af m ; n if Equation 1111 is used (noted here as M^?^ 1 ) 

to fit clustering data and /shmrCM* 1 ) (noted here as AfjSf). The 
difference is negligible below log 10 (A:f m i n ) < 12 but can be of order 
10 to 40% at higher masses. 



iZheng et all (|2007t) find that M sat /M min ~ 18 for 
SDSS and M sat /M m i n ~ 16 using luminosity defined 
samples in DEEP2. For B cut , the expectation is that 
the cutoff mass scale occurs at M min < M cut < M sat , 
although it can be significantly smaller. 

3.4. Total stellar mass as a function of halo mass 

Using this model, one can also compute the total 
amount of stellar mass in galaxies (summing the con- 
tribution from both centrals and satellites) as a function 
of halo mass Mh- To begin with, let us consider the 
total stellar mass as a function of halo mass in some stel- 
lar mass bin: M* ot (Mh | M* 1 , Ml 2 ) . The expression for 
M* ot (M fc |M*\Mf ) is given by: 



Mt 1 



[$ c (M*\M h ) + $ fl (M,|M h )] MAM,. (15) 



However, in the previous section we have only specified 
the analytic form for $ c (Equation [T]) but not for $ s . 
Indeed, in § 13.31 we outlined an analytic model for (-/V sat ) 
but calculating the analytic derivative of (iV sat ) would be 
tedious. Thankfully, however, we do not specifically need 
to know the functional form of $ s in order to calculate 
Equation [15] By using the integration by parts rule, we 
can re- write Equation [15] in a more convenient form as 
follows: 



Ml ot (M h \Ml\M, 



1.2 \ 



Ml 



M* 1 



M 1 



ft/ft- 

(N cen (M h \M*))dM* - [{N cm (M h \M*))M t ]%> tl 



(N sat (M h \M*))dM* - [(N aa ,t(Mh\M*)) M+ImIi ■ 



Ml ot (M h \Mt\M, 



t2\ 



(16) 

This equation provides us with a convenient way to 
calculate the total stellar mass locked up in galaxies with 
Ml 1 < M* < Ml 2 as a function of halo mass M h . 

3.5. Summary of model parameters 

In total, we have introduced six parameters to model 
the central occupation function (M\, M^o, fi, 5, 7, 
ciogM.) and five parameters to model the satellite oc- 
cupation function (/3 sat , B sat , /3 cut , B cut , a). In addition, 
one could introduce a model for <7io g M, or assume that 
the scatter is constant in which case there would be a 
total of eleven parameters for this model. In Figure 8 
of Paper II, we show the two dimensional marginalized 
distributions for this parameter set using data from the 
COSMOS survey. The model described in this paper 
provides an excellent fit to COSMOS data. Figure 8 in 
Paper II demonstrates that this model is reasonably free 
of parameter degeneracies. A summary and description 



8 



A. Leauthaud et al. 



of these parameters can be found in Table [TJ Figure 2] 
gives an illustration of the central and satellite occupa- 
tion functions for galaxy samples in bins and thresholds 
of stellar mass. 

4. HOW TO DERIVE THE SMF, G-G LENSING, AND 
CLUSTERING FROM THE MODEL 

We now describe how the model outlined in the previ- 
ous section yields analytic descriptions for the g-g lens- 
ing, clustering, and stellar mass function which can then 
be fit simultaneously to observations. 

4.1. Analytical model for the stellar mass function 

The stellar mass function is typically calculated in 
bins in stellar mass. Let us consider the stellar mass 
bin Ml 1 < M* < Af* 2 . The abundance of galaxies 
within this stellar mass bin, ^SMFiM* 1 , A/* 2 ), is simply 
obtained from our model and the halo mass function, 
dn/dMh, according to: 

®SMF{Mt\Mt*) 



4.3. Relationship between AS, the density field, and 
correlation functions 

Consider two different populations characterised re- 
spectively by 5 a and 5b- The two-point cross- correlation 
function of <5 a and 5b at comoving position r co is given 
by: 

£,ab{r co ) = (5a(r co )5 b (x co + f co )). (20) 

For example, if i5 g and <5d m are respectively the over- 
densities of galaxies and dark matter, then we can char- 
acterize their relative distributions via the galaxy-mass 
cross-correlation function which is noted £ gm and is equal 
to: 

£ g m(7co) = (<5 g (rco)(5dm(a;co + »'co)>- (21) 

Similarly, £ gg refers to the galaxy auto- correlation func- 
tion. 

In the following, r co is the three dimensional comov- 
ing distance, rj| jCO is the projected comoving line-of sight 
distance, and r p co is the projected comoving transverse 
distance: 



Ml 2 



$(MJM h )dM* 



ML 



dn 



dM h 



(N tot (M h \Mt\M?))-^ 
o aM h 



dM h 



(17) 



where we recall that <J>(M*|Mfc.) represents the condi- 
tional stellar mass function and (iVtot) is the total occu- 
pation function (including both satellites and centrals). 

4.2. The Lensing Observable, AS 

The shear signal induced by a given foreground mass 
distribution on a background source galaxy will depend 
on the transverse proper distance between the lens and 
the source and on the redshift configuration of the lens- 
source system. A lens with a projected surface mass 
density, S(r), will create a shear that is proportional to 
the surface mass density contrast, AS(r): 

AS(r) = S(< r) - S(r) = S crit x j t (r). (18) 

Here, S(< r) is the mean surface density within proper 
radius r, S(r) is the azimuthally averaged surface den- 
sity a t radius r fe.g.. iMiralda-Esc udc 199il lWilson et all 
120011) . and jt is the tangentially projected shear. The ge- 
ometry of the lens-source system intervenes through the 
critical surface mass density S cr ; t , which depends on 
the angular diameter distances to the lens (Dol), to the 
source (.Dos), and between the lens and source (-Dls) : 



^os 



(19) 



4nG N D OL D LS ' 
where Gn represents Newton's constant. 

2 Note that some authors consider the comoving critical surface 
mass density which has an extra factor of (1 + z)~ 2 with respect 
to ours. 



p,co 



II. 



(22) 



In Paper II, we will employ physical coordinates 
for g-g lensing measurements whereas for clustering 
we will use comov i ng coo rdinates. In previous work, 
iMandelbaum et all (l2006bl) hav e used comoving coordi- 
nates and lJohnston et al.l ( |2007l ) have used physical coor- 
dinates. Therefor e, our g-g lensing formu las more closely 
resemble those of I Johnston et aiT(|2007h . The relation- 
ship between comoving and physical distances is simply 
T co = r ph(l + z). In a similar fashion to Equation I22[ we 
can write that: 



p,ph 



,ph- 



(23) 



In comoving coordinates, the density field is p(r COl z) — 
p(l + £gm(fco, z)) where p = p c .o^m,o is the average den- 
sity of matter in the Universe. Since £ gm is often ex- 
pressed in comoving coordinates, we first derive S in co- 
moving units (noted S co ) and then we transform S into 
physical units (noted S p h) before computing AS. For a 
lens at redshift zl, the projected surface mass density, S, 
is obtained by integrating the 3d density over the line- 
of-sight: 



Sco(^p,CO; ^l) 



r 2 
p,co 



r ',co> Z iJ dr ll,co 



i + 4 



p,co 



||,co> 



(24) 



where r PiCO and ^|| jC o refer respectively to the comoving 
transverse and line-of-sight distance from the lens. In 
principle, this integral should extend from the redshift 
of the observer (zq) to the redshift of the source (zs). 
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TABLE 1 

Parameters in model 



Parameter 


Unit 


Description 


/AT \ — . / A7 \ 

(Ween) Or (-/Vsat) 


Mi 


M 


Characteristic halo mass in the SHMR 


(iV cen ) 




M 


Characteristic stellar mass in the SHMR 


(N cen ) 


P 


none 


Faint end slope in the SHMR 


(iVcen) 


s 


none 


Controls massive end slope in the SHMR 


(Ween) 


7 


none 


Controls the transition regime in the SHMR 


(Ween) 


CT logM» 


dex 


Log-normal scatter in stellar mass at fixed halo mass 


(Ween) 


/Jsat 


none 


Slope of the scaling of M sa t 


(iV aat ) 


Bsat 


none 


Normalization of the scaling of M sa t 


(iVsat) 


/?cut 


none 


Slope of the scaling of M cu t 


(iVsat) 


S cut 


none 


Normalization of the scaling of M cu t 


Wat) 


"sat 


none 


Power-law slope of the satellite occupation function 


(iV aat ) 




Fig. 4. — Illustration of the occupation functions for various galaxy samples as a function of stellar mass. The upper panels represent 
binned galaxy samples whereas the lower panel represent threshold samples. Left panels: central occupation function. Middle panels: 
satellite occupation function. Right panels: total occupation function where (Ntot) = {N ce n) + {N sa t)- The parameters chosen for this 
HOD model correspond to the best fit parameters from Paper II for 0.48 < z < 0.74. 



However, £ gm falls off rapidly enough that in practice, 
the redshift evolution of £ gm can be neglected and in- 
tegrating only out to a distance of r*| | co = 50 Mpc is 
sufficient. Furthermore, for the purpose of computing 
AS, the constant term in Equation [24] can be dropped 
(due to the subtraction in AE) and the mean excess pro- 
jected density E(r) can be approximated by the radial 
integral: 



2p c ,ofl m ,0 J o £gm (y^p,co + r j|, co >^j dr ||,co- (25) 

The mean excess projected density is physical units is 
then S pll = E co x (1 + z) 2 . 



The average £ within radius r is equal to: 

S ph (< r p . ph ) = f P ' Ph S ph (r')r'dr'. (26) 

r p,ph Jo 

Finally, AE is obtained by combining Equations ITSl 1251 
and [261 

4.4. Analytical modeling of t; gg and w(9) 

Our model for calculating the autocorrel ation function 
of gal axies is ba sed on the m odel given in I Tinker et al.l 
(|2005| ) (also see lZhengll2004[ ). As described above, the 
HOD is broken into central and satellite galaxy occu- 
pation functions. Thus, in the HOD context, pairs of 
galaxies come from two distinct terms; pairs within a 
single halo and pairs between galaxies in two different 
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halos. The total correlation function is 



£ 99 (r co ) + 1 = [C(r co ) + 1] + [^(r co ) + l] , (27) 

where lh and 2h refers to "one-halo" and "two-halo" 
terms, respectively. The one-halo correlation function is 
written as 



dMh ^L(m : i))Mj^ F ,{r c 



dM h 



2Rh \2Rh 



(28) 

where n g is the space density of galaxies in the sample 
being modeled, (N(N—1))m is the second moment of the 
distribution of galaxies within halos as a function of halo 
mass, and F'(x) is the radial distribution of pairs within 
the halo normalized to unity. Within a halo, pairs of 
galaxies can be between the central galaxy and a satellite, 
or between two satellite galaxies. The radial pair profile 
is different for these two combinations, thus we express 
their relative contributions to £™(?" co ) as 



(N(N-1)) M 



F'(x) = {N cen N sat ) M F^ s (x) 

| (N sat (N sat -l))M F , 



(29) 

where F' cs is the pair distribution for central-satellite 
pairs and F' ss is the equivalent for satellite-satellite pairs. 
The former is related to the density profile of satellite 
galaxies, and the latter is related to the density profile 
convolved with itself (ana lytic expressions fo r this con- 
volution can be found in ISheth et al.ll200l . Here we 
assume that the radial distribution of satellite galax- 
ies is the same as the dark matter, for which we as- 
sume the profile fo rm of Navarro-Frenk- White (NFW; 
iNavarro et al.lll997f) using th e mass -concentration rela- 
tion of IMunoz-Cuartas et all (|2010f ). Since we assume 
that satellites trace the dark matter, F' cs will be equal 
to the quantity F' c that we will introduce in Equation 1551 
and F' ss will be equal to F' s . 

Central galaxies only exist as one or zero objects in a 
halo, thus they have no second moment. For the second 
moment of satellite galaxies, we assume Poisson statis- 
tics about (iV sa t), which is in goo d agreement with re- 
sults from numer ical simulations ([Kravtsov et al.l [20041 : 
iZheng et al.ll2005D . Possibl e deviations from Poisson be- 
havior (jBusha et al.l 120101: iBovlan-Kolchin et al.l 12010ft 
mainly affect clustering for galaxy samples where a ma- 
jority of the satellites originate in halos with Mh < M sat 
because in this case, (N sat ) drops to (N sat ) < 1. Lumi- 
nous Red Galaxies (LRGs) fall into this category for ex- 
ample. Indeed, satellite galaxies in LRG samples mainly 
originate in Mh < M sat halos because M sat is close to the 
exponential cut-off in the halo mass function (for LRGs, 
M sat ~ 4 x 10 14 M Q ). However, for the types of sam- 
ples that we consider in Paper II, deviations from Pois- 
son statistics should not significantly effect the clustering 
predictions of the HOD. 



A deta iled description of t he two-halo term can be 
found in iTinker et ail (|2005l ). Briefly, in the regime 
where r > Rh of massive halos, this term can be ex- 
pressed as 



&co) = & 2 9 C 2 (r co )£ m (r co ), 



(30) 



where £, m {r C o) is the non-linear matter correlation func- 
tion and b g is the large-scale bias of galaxies in the sam- 
ple, and C( r co) is the scale dependence of dark matter 
halo bias. For ^m (r co ), we use the fitting function of 
ISmith et all (l200l For C (r rn ), we use the fitting func- 
tion of ITinker et al.l (120051 ) . The galaxy bias is computed 
from the HOD by 



n' 1 / b(M h )(N) M 



dMh 



-dM h 



(31) 



wh ere dn/dMh i s the halo mass function, for which we 
use lTinker et al.l (|2008), and b(Mh) i s the halo bias func- 
tion, for which we use [Tinker et al] ([20Tof ). 

In the regime where r < Rh, Equation 1301 breaks down 
due to the effects of halo exclusion; i.e., the effect that 
the center of one halo cannot exist within the virial ra- 
dius of another halo (and still be consid ered a "two- 
halo" pair). This is explained in detail in ITinker et al.l 
(|2005| ) . Because the mass function and bias relation used 
in this analysis are taken from numerical res ults based on 
spher i cal-oy erdensity (SO) halo catalogs (ITinker et al.l 
120081 I2010D . the halo exclusion must be modified to 
match t his halo defin i tion. In the SO halo finding algo- 
rithm of lTinker et al.1 (|2008| ) , halos are allowed to overlap 
so long as the center of one halo is not contained within 
the radius of another halo. Thus, the minimum sepa- 
ration of two halos with radii Ri > R2 is R±, rather 
than t he sum of the two radii, as done in ITinker et al.l 
(|2005l ). For a projected statistic like w(0), this makes 
only a small difference in clustering at the 1-halo 2-halo 
transition, but significantly speeds up computation of the 
2-halo term. 

Once we have calculated £, gg (r co ) for a given HOD 
model, we compute the observable w{9) by: 

w(6) = J dzN 2 (z) ^ J dx^ (V^ 2 + r 2 # 2 ) . (32) 

where N (2) is the normalized redshift distribution of the 
galaxy sample, r co is the comoving radial coordinate at 
redshift z and dr co /dz — (c/i/ )/-\/^m(l + z) 3 + f^. 

Figure [5j shows the breakdown of the angular correla- 
tion function into the one-halo and two-halo terms for 
low-mass and high-mass galaxy samples. 

4.5. Analytical modeling of £ 9m and AS 

We have shown in § 14.31 that the lensing observable, 
AS, can be obtained from £ 9m by performing two inte- 
grals (Equations l25l and 126]) . Following the approach of 
IY00 et all ([20061 ). ^ gm is computed from the HOD and 
AS is obtained by combining Equations [TSl ES] and [26] 
Thus AS is fully specified given our model. Note that 
the calculation of ^ gm is performed in comoving units and 
then the projections of Equations l25landl26lto obtain AS 
are performed in physical units (to match our measured 
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Fig. 5. — The angular correlation function for two different stellar mass thresholds. The filled points are the full HOD calculation for 
w(8). The different curves break the calculation into its distinct parts: The solid curve is the 1-halo term, the dotted curve is the two 
halo term. We further break the 1-halo term into the relative contribution of central-satellite galaxy pairs ( dash-dot curve) and satellite- 
satellite galaxy pairs (long-dash curve). For more massive galaxy samples, the one- halo term is more prominent and it is dominated by 
central-satellite pair counts. 



g-g lensing signal which is computed in physical units in 
Paper II). 

Typically, £ 9m is decomposed into a one-halo and a 
two-halo term 



1 + Wco) = [1 + e g i l n (r co )} + [1 + ^(r co )], (33) 

where the one-halo term represents galaxy-matter pairs 
from single halos and is dominant on small scales (<Mpc) 
and the two-halo term corresponds to pairs from distinct 
halos and is dominant on larger scales (>Mpc). The one- 
halo term is obtained according to: 



1 + £gm ( r co) 



dn M h 1 _ {Ntot)pl ( r co 



dM h p 2R h 



\2R h 



dM h 



(34) 
are 



Further details about the origin of Equation 
presented in the Appendix. 
The product n g F' is split into two terms: 

n g F'{x) = n c F' c {x) + n s F' s {x), (35) 

where F' c is linked to the density profiles of dark matter 
halos (see Appendix) and F' s is related to the convolu- 
tion of the dark matter density profile and the satellite 
galaxy distribution. To calculate F' c we assume spher- 
ical NFW profiles truncate d at Rh and we adopt the 
IMuhoz-Cuartas et al.l (|2010f ) mass-concentration relation 
for a WMAP5 cosmology. To calculate F' s we assume 
that the satellite galaxy distribution follows the dark 
matter distribution. Given this assumption, F' s is simply 
related to the convolution of the NFW profile with itself. 

In this paper, we neglect the contribution to AE from 
sub-halos; lYoo et ail (|2006l) have shown this component 



to be negligible at the 10% level. 

We cal c ulate the two-halo term as described in 
lYoo et ail (|2006f ). with two major exceptions. First, as 
stated in the previous section, we are using the halo ex- 
clusion from the SO h alo definition. Secon d, because the 
halo mass function of [T inker et al. (2008) and halo bias 
function of lTinker et al.l ( 20101 ) are normalized such that 
integrating over all Mh produces the mean matter den- 
sity and a bias of unity, there is no need to employ the 
"break mass" from lYoo et ail (|2006T ) (see their Equation 
16). In the limit where r > Rh of massive halos, 



£gm( r co) — b g (^{r co )^ m (r co ), 



(36) 



analogous to Equation l30l 

In addition to the two terms presented above, we add 
anothe r component to t he modeling of AE which is ab- 
sent in lYoo et~a l. (2006), namely the contribution to AE 
from the baryons of the central galaxy which can be non 
negligible on very small scales (< 50 kp c). Although the 
baryons typically follow Sersic profiles (|SersidlT963[ ). at 
the scales of interest for this study, well above a few ef- 
fective radii (> 20 kpc), the lensing contribution of the 
baryons can be modeled by a simple point-source, scaled 
to (M*), the average stellar mass of the galaxies in the 
sample: 



AE 



stellar 



(r) 



(37) 



In total, the final g-g lensing signal is modelled as the 
sum of three terms: AE to t = AE sto iiar + ASn, + AE 2 h. 
Note that the one halo and two halo terms can also be 
decomposed into central and satellite contributions but 
for simplicity, we have grouped these terms together. 

In order to illustrate the various terms that contribute 
to the g-g lensing signal, we have plotted the signal in 
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Figure [HI 

5. INFLUENCE OF THE MODEL PARAMETERS ON THE 
OBSERVABLES 

In the previous section we have outlined how our model 
can be used to analytically predict the SMF, g-g lensing, 
and clustering signals. We will now investigate how each 
parameter in the model affects the three observables. For 
this exercise, we adopt the best fit model parameters 
for 0.48 < z < 0.74 from Paper II and we vary each 
parameter in turn by 2er around the best fit model. For 
this section, we assume that <Ji og M t is constant. We also 
assume that a sa t is constant and we set a sa t = 1 in this 
section since this is also the assumption that we make in 
Paper II. In total, we therefore study the effects of ten 
parameters: Mi, M* >0 , /3, S, 7, eri og M„, A»t, -Bsat, ficut, 
and B cut . The results are shown in Figures [7] and |8] and 
are described in further detail below. 

5.1. Effect of parameters on the SMF 

The influence of each model parameter on the observed 
SMF is shown in the right hand column in Figure [7] (the 
same column is reproduced in Figure |8]) . The data point 
with an error bar represents the typical error bar for a 
COSMOS-like survey where the error bar includes sam- 
ple variance computed from a series of mock catalogs (de- 
scribed in the following section). The first point worth 
mentioning here is that the errors on the SMF are rela- 
tively small compared to the clustering and the lensing. 
It is always the case that a measurement of a one-point 
statistic from a given set of data is more precise than a 
measurement of a two-point (or higher) statistic. The im- 
plies that the SMF will in general play an important role 
in constraining the parameters of the SHMR. However, 
in follow-up work we will investigate the sensitivity of 
this probe combination to cosmological parameters and 
models of modified gravity for example. In these studies, 
the clustering and the g-g lensing will play a critical role 
despite their typically larger errors bars. 

A second noteworthy point in Figure[7]is that the SMF 
appears to be sensitive to all ten parameters whereas 
certain parameters such as M* an d P have very little 
effect on the clustering and lensing signals. Coupled with 
the fact that the SMF has fairly small error bars, this 
implies that the SMF will have quite a lot of constraining 
power on the overall model compared to the g-g lensing 
for example. 

The effects of M 1 and A/» j0 on the SMF are fairly in- 
tuitive. M\ roughly induces an up/down shift in the am- 
plitude of the SMF and M*^ corresponds to a left/right 
shift in the SMF. A larger value of B sat implies that there 
are fewer satellite galaxies in high mass halos for a given 
stellar mass threshold. Thus we see that an increase in 
B sa t corresponds to a decrease in the amplitude of the 
SMF due to the fact that the contribution from satellite 
galaxies has decreased. Similar arguments apply to B cut . 
From Figure [7] we can anticipate that if the SMF were 
only used to constrain this model, degeneracies would oc- 
cur between B sa t, B cut , and M\. Fortunately, the satel- 
lite parameters have a significant effect on both the clus- 
tering and the g-g lensing so this degeneracy should be 
broken when all three probes are used in conjunction. 

In Figure [9] we highlight the effects of four particular 
parameters on the SMF: j3, 7, S, and erio g M» • The j3 pa- 



rameter affects the low-mass slope of the SMF so that a 
larger value of (3 corresponds to a steeper low-mass slope 
in the SMF. The 7 parameter (we recall that this con- 
trols the transition region of the SHMR as can be seen 
from Figure [TJ has an interesting effect since it regulates 
a "plateau" feature in the SMF at log 10 (M*) ~ 10.5. 
In fact, this feature in the SMF has been noticed al- 
ready and discussed in detail for example in iDrorv et al.l 
(|2009f) . In IDrorv et ail (|2009f ). this feature was described 
as a "dip" because at these scales, the SMF is below the 
best fit Schecter function. However, we note that "dip" 
is a somewhat misleading name for this feature since it 
could also be taken to mean that d-/V/dlog 10 (M*) does 
not decrease monotonically with M*. A close inspection 
of our SMFs in Paper II shows that there is no evidence 
in the data for an actual "dip" in dJV/dlog 10 (M*). In- 
stead, the data are more consistent with a flattening of 
dAT/dlog 10 (M*) around log 10 (M») ~ 10.5. Therefore, 
we would like to suggest that this feature should be de- 
scribed as a "plateau" in the SMF rather than a "dip" . 

In Figure [10] we show the link between the dark mat- 
ter halo mass function, the SHMR, and the SMF. In 
this Figure, we have used the fact that diV/d log 10 M* = 
diV/d log 10 Mh x (d log 10 Mh/d log 10 M*) so that the var- 
ious functions can be linked "by eye" by drawing a box 
between the four different panels. We have illustrated 
how to link the various functions with the dash-dash lines 
in Figure [TU] at the scale of the pivot stellar mass. Fig- 
ure (TU] shows that the "plateau" feature is caused by the 
transition that occurs in the SHMR at M/, ~ 10 12 M 
from a low-mass power-law regime to a sub-exponential 
function at higher stellar mass. 

Finally, the scatter in stellar mass at fixed halo mass 
has a noticeable effect on the SMF at the high mass end 
which is also commonly referred to as Eddington bias. 
A larger value of ciogM, will lead to an inflated observed 
SMF at large stellar masses. 

5.2. Effect of parameters on g-g lensing 

The g-g lensing signal is dominated by the central 
one-halo term roughly on scales below 0.3 Mpc and by 
the satellite one-halo term roughly on scales above 0.3 
Mpc and below a few Mpc (see Figure [6]). Thus, the 
effects of the four parameters that regulate the satel- 
lite occupation function (B sat , B cut , /3 sat , (3 cut ) have a 
strong scale-dependant effect on the g-g lensing signal. 
For example, -B sa t controls the power-law amplitude of 
(N sa t). A smaller value of -B sat will reduce the ratio 
M sat // S ~J ]R (M| 1 ) and will therefore increase the number 
of satellites in a sample. This will lead to an increase 
in the g-g lensing signal from 0.3-1 Mpc due to the in- 
creased amplitude of the one-halo satellite term. Sim- 
ilarly, a smaller value of (3 sa t will also reduce the ratio 
M sa t// S HM R (-M'| 1 ) and by consequence, will increase the 
one-halo satellite term. This effect is more pronounced in 
Figure [5] which illustrates a low-stellar mass sample com- 
pared to Figure [7] which illustrates a high stellar mass 
sample. 

Another parameter worth discussing here is ciogM, • 
Figure [7] demonstrates that eri og M, has a stronger effect 
on the lensing signal for high stellar mass samples com- 
pared to low stellar mass samples. As discussed in § 12.21 
this is simply due to the fact that the data are binned ac- 
cording to M* . The observables are therefore sensitive to 
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Illustration of the various terms that contribute to a g-g lensing signal. The solid black curve shows the total g-g lensing signal 
which can be decomposed into a sum of terms that contribute at various scales. On small scales (~ 10 kpc), the signal is dominated by the 
baryonic content of galaxies represented by the red curve (dot-dot). At intermediate radii (~ 200 kpc), dark matter halos come into play as 
shown by the blue (dash-dash) and magenta (dash-dot) curve. The former represents a NEW profile and the later is due to a contribution 
from satellite galaxies. On large scales (> 3 Mpc), the g-g lensing follows the dark matter linear auto-correlation function scaled by the 
bias factor as depicted by the grey curve (dash-dot-dot-dot). It is interesting to note that the total g-g lensing signal is roughly a power 
law, despite the fact that the various contributing components deviate strongly from power laws. 



the scatter in halo mass at fixed stellar mass, <7iogM h ■ At 
fixed ciogM, , 0io g M h will increase with M*. As a result, 
the effects of scatter are more prominent in g-g lensing 
measurements for high stellar mass samples. 

5.3. Effect of parameters on clustering 

To first order, the stellar mass function and the clus- 
tering of galaxies are tethered; more massive halos are 
both more clustered and less abundant. This is also true 
of galaxies because rare, massive galaxies live in such 
halos. If the amplitude of the stellar mass function in- 
creases, the clustering as a function of stellar mass de- 
creases. This is especially true at masses above the knee 
in the stellar mass function. From Figure [9j increasing S 
or criogM* increases the abundance of high-mass galaxies. 
Given that the number of halos is fixed, this can only 
mean that massive galaxies are occupying less massive, 
less clustered halos. 

There are several parameters that have a direct influ- 
ence on the clustering of galaxies without changing the 
stellar mass function appreciably. The parameters B sa ± 
and /3 sa t are the most important in this regard. They 
control the "shoulder" in the HOD, defined conceptu- 
ally as the increase in halo mass, relative to f^u^Ml 1 ), 
before satellites begin to enter the sample. Quantita- 
tively, this is expressed as the ratio M sat // S H^ R (M* 1 ), as 
shown in Equation ll2l Reducing this ratio increases the 
number of satellite galaxies in a sample, which in turn in- 
creases the large-scale bias of a sample and significantly 
enhances the clustering within the one-halo term. The 
parameters B cut and /3 cu t have a more subtle effect on 
clustering. If the cutoff mass, defined by Equation QJ] is 
below /sIjmrO^* 1 )) then M cut has no effect on clustering. 
But as M cut increases, satellite galaxies are removed from 
low-mass halos. If the density of satellites is held fixed, 
increasing M cut redistributes satellites into more massive 
halos. This will increase the large-scale bias and change 
the shape of the one-halo term such that the correlation 
function deviates from a pure power law form (see the 



Appendix in lZheng et aT1l2009t) . 

6. MOCK CATALOGS, SAMPLE VARIANCE, AND 
COVARIANCE 

In this section we construct mock catalogs in order to 
investigate the effects of sample variance and covariancc 
associated with measurements of g-g lensing, clustering, 
and the SMF. Sample variance occurs due to the finite 
nature of the volume encompassed by any given survey. 
Because of limited volume, any given survey may yield 
a biased measurement of the number density of galaxies 
and halos compared to the full universe. The error bars 
on all three observables must therefore reflect this addi- 
tional source of error. Also, the data-points in all three 
observables will be correlated to some degree. Consider 
the SMF for example. A region of space with high mat- 
ter density will have an increased abundance of galaxies 
of nearly all masses. For w(9), because it is a projection 
of £ gg (r) — which is itself a correlated quantity — multiple 
physical scales will contribute to each bin in 9. For AS, 
wc will show that the data points are correlated on scales 
where satellite galaxies contribute to the lensing signal. 

To investigate both the sample variance and the co- 
variance associated with all three observables, we use 
numerical simulations to construct a series of mock cat- 
alogs for a COSMOS-like survey. Since the volume of 
COSMOS is relatively small, the effects of variance and 
covariance will be quite apparent (whereas the effects 
would decrease if we simulated a larger fiducial survey) 
and so COSMOS is well suited for our purpose. In addi- 
tion, wc will also use these mock catalogs in Paper II to 
analyze the actual COSMOS data. 

COSMOS-like mocks are created from a single simula- 
tion (named "Consuelo") 420 h~ l Mpc on a side, resolved 
with 1400 3 particles, and a particle mass of 1.87xl0 9 
hr 1 Mg0. This simulation can robustly resolve halos 
with masses above ~ 10 11 hr 1 M Q and is part of the 
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Fig. 7. — Effect of varying each of the 10 parameters in turn by la around the best fit model from Paper II for 0.48 < z < 0.74 
where a is the fitted error on the parameter in question. In the left hand column of figures, we show how the predicted w(8) signal 
varies for log 10 (M») > 11.1 (a high stellar mass threshold). In the middle column of figures, we show how the predicted AS signal 
varies for 11.29 < log 1 Q(Af«) < 12 (a high stellar mass bin) and the right hand column shows the predicted variations for SMF down to 
log 10 (M„) = 9.3. To highlight the effects of the parameter variation, in all three cases we have plotted the model minus the fiducial best- fit 
model divided by the fiducial best-fit model. The data point with an error bar represents the typical error bar for each observable for a 
COSMOS-like survey. 
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Fig. 8. — Same as Figurc[7|but the left hand column of figures now shows w(8) for log 10 (Af») > 9.3 (a low stellar mass threshold) and 
the middle column of figures now shows AS for 9.8 < log 10 (M*) < 10.3 (a low stellar mass bin). The right hand column (depicting the 
SMF) is identical to Figure [7] 
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Fig. 9. — Effects of ft, 5, 7, and o"i ogMt on the shape of the observed SMF. Left upper panel: ft determines the low mass slope of the 
SMF. Right upper panel: <5 affects the knee and the high-mass slope of the SMF. Left lower panel: 7 affects the knee of the SM F but 7 also 
affect s the "plateau" feature that has been observed in the SMF at 10 < log 10 (M») < 10.5 (see discussion and references in lDrorv et al.l 
120091) . Right lower panel: (Xi og M« affects the high mass slope of the SMF but also affects the "plateau" feature. A larger value of <7i ogMi> 
leads to an inflated observed SMF at the high mass end. This effect is also commonly referred to as Eddington bias. 



Las Damas suitcQ (McBridc ct al. in prep). We cre- 
ate mocks for three redshift intervals: z\ = [0.22,0.48], 
z 2 = [0.48,0.74], and z 3 = [0.74, 1]. For each redshift in- 
terval, we construct a scries of mocks created from ran- 
dom lines of sight through the simulation volume that 
have the same area as COSMOS and the same comov- 
ing length for the given redshift slice. This yields 405 
independent mocks for the Z\ bin, 172 mocks for the z 2 
bin, and 109 mocks for the Z3 bin. For each redshift bin, 
mocks are created from the simulation output at the me- 
dian redshift of the bin. 

Halos within the simulatio n are identified w ith the 
friends-of-friends halo finder ([Davis et al.l Il985h with a 
linking length of b=0.2. For each redshift interval, halos 
are populated with our best-fit model from Paper II. We 
use the mock-to-mock variance and covariance to esti- 
mate a covariance matrix for the stellar mass function, 
for w(9) (using a series of stellar mass thresholds), and 
for AE (using a series of stellar mass bins). Although the 
data points between the different quantities will be corre- 
lated to some degree (as well as the bins in w(8) and AS) 
we ignore that covariance as we do not have enough sim- 
ulation volume to estimate the w&er-covariance matrix of 
all [N] data points in each redshift bin. 

Figure QT] shows the correlation coefficient matrix for 
the SMF in three redshifts bins for a COSMOS-like sur- 
vey. The first-order effect of sample variance on the SMF 
is to correlate all of the data points so that globally, the 
SMF will shift up and down for different realizations of 
a COSMOS-like survey. 

Figure [12] shows the correlation coefficient matrix for 

4 Details regarding this simulation can be found at 
http : //lss .phy . vanderbilt . edu/lasdamas/simulations .html 



the galaxy clustering for 0.22 < z < 0.48 and for three 
stellar mass thresholds: log^M*) > 9.3, log 10 (M„) > 
10.3, and log 10 (M*) > 11.1. The data are more cor- 
related at larger scales where galaxy pairs come from 
the two-halo term. As shown earlier, clustering at these 
scales is proportional to the matter clustering £ m (r). 
Patches of the universe that exist in an over- or under- 
density tend to have higher or lower clustering in their 
matter. This will be reflected in the clustering of the 
halos and thus the two-halo term for the galaxies. In 
the one-halo term, Poisson fluctuations of the number of 
satellites become more important and the data are less 
correlated at these scales. Overall, as the density of the 
galaxy sample becomes smaller, shot noise will dominate 
on all scales. This can be seen in the progression from 
left to right in the examples in Figure [T2J 

Figure [13] illustrates the effect of sample variance on 
g-g lensing signals for various stellar mass bins and for 
0.22 < z < 0.48. Figure [T4l shows the associated corre- 
lation coefficient matrices. The key point to note here is 
that the sample variance for g-g lensing is dominated by 
the one-halo satellite term on scales of about 100 kpc to 
1 Mpc. The impact of this term becomes more apparent 
in galaxy samples with lower stellar masses as the con- 
tribution from the one-halo central term decreases. The 
fact that the one-halo satellite term has a large sample 
variance compared to the one-halo central term can be 
understood as follows. Consider a sample of galaxies in a 
given stellar mass bin. The galaxies that are satellites in 
this sample will tend to live in more massive halos than 
the galaxies that are centrals (this can be seen in Fig- 
ure [4] for example) . Since more massive halos are more 
rare than less massive halos at fixed survey volume, this 
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Fig. 10. — Illustration of the link between the halo mass function (upper left panel), the SHMR (upper right panel) and the SMF (lower 
right panel). The contribution to the SMF from satellite galaxies has been subtracted so that the SMF in the lower right hand panel only 
depicts the contribution from central galaxies. The dot-dot line shows the SMF when 0"i og M„ = 0. This figure is designed that all three 
quantities can be linked by drawing a "box" between the four panels as shown here by example with the dash-dash lines at the location of 
the pivot stellar mass. We note that this will only work when the SMF with zero scatter (dot-dot line) is considered. This figure shows for 
example that the location of the pivot stellar mass is coincident with the location of the "plateau" feature in the SMF. 
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Fig. 11. — Correlation coefficient matrix for the SMF in three rcdshift bins for a COSMOS-like survey. The effect of sample variance on 
the SMF is to correlate all of the data points so that globally, the SMF will shift up and down for different realizations of a COSMOS-like 
survey. 



explains the large one-halo satellite sample variance. 

Finally, we also use mock catalogs t o estimate the ef- 
fects of the integral constraint (IC) (jGroth fc Peebles! 
19775) on clustering measurements for a small area sur- 
vey. Due to spatial fluctuations in the number density of 
galaxies, the mean correlation function measured from an 
ensemble of samples will be smaller than the correlation 
function measured from a single contiguous sample of the 
same volume as the sum of the ensemble sample. This 
attenuation of w(0) becomes relevant on angular scales 
significant with respect to the sample size. For large sur- 



veys like the SDSS, the IC is not an issue on scales of 
interest. For a pencil-beam survey like COSMOS, how- 
ever, the IC must be taken into account when modeling 
the clustering. We estimate the IC correction to our w(9) 
measurements through the use of the mock galaxy dis- 
tributions described previously. The results are shown 
in Figure fT5l For COSMOS, our fitting functions for the 
IC correction are: 
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Fig. 12. — Correlation coefficient matrix for galaxy clustering for 0.22 < z < 0.48 and for several stellar mass thresholds. 
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Fig. 13. — Illustration of the effects of sample variance on the g-g lensing signals for a COSMOS-like survey. The light grey region 
represent the 2<r variation between mocks and the dark grey region represents the lcr variation between mocks. Sample variance mainly 
affects the one-halo satellite term of the g-g lensing signal. For example, the large variance that can be seen in the bottom right panel 
(8.7 < log 1 g(Af*) < 9.2) from 100 kpc to 1 Mpc is due to the one-halo satellite term. This can be explained by the fact that, for a given 
stellar mass sample, the parent halos of satellite galaxies are more rare at fixed volume than the halos of central galaxies. 



f IC = exp(log lo (0)/3.2) 



4.1 



(39) 



for 0.22 < z < 0.48 and 0.48 < z < 0.74 respectively and 
where 9 is expressed in arc-seconds. For 0.74 < z < 1 
there is sufficient volume such that fjc = 1. We note 
that these fitting functions are only valid for 8 < 10 3 
arcseconds. 

7. SUMMARY AND CONCLUSIONS 

The goal of this paper is to develop the theoreti- 
cal framework necessary to combine measurements of 
galaxy-galaxy lensing, galaxy clustering, and the galaxy 
stellar mass function, into a single and more robust probe 
of the galaxy-dark matter connection. We have achieved 
this goal by introducing several key modifications to the 
standard HOD framework. To begin with, we have mod- 
ified the standard HOD model so as to fit all three probes 



simultaneously and independantly of the selected binning 
scheme. Next, since we are interested in the galaxy- 
dark matter connection, we have also modified the HOD 
model so as to specifically include the stellar-to-halo mass 
relation (SHMR). In a companion paper, Leauthaud et 
al. 2011b, we demonstrate that the model presented here 
provides an excellent fit to galaxy-galaxy lensing, galaxy 
clustering, and stellar mass functions measured in the 
COSMOS survey from z = 0.2 to z = 1.0. 

Nonetheless, while the promise of combined dark mat- 
ter probes in studying galaxy formation, gravity and 
cosmology is clear, we must ensure that our paramet- 
ric description of the SHMR is sophisticated enough to 
capture its possible behavior. There are a number of 
questions that remain to be answered in order to achieve 
this goal. For example, is P(M*\Mh) well described by a 
log-normal distribution and is the scatter in P(M*|M/j) 
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Fig. 14. — Correlation coefficient matrix for g-g lensing for 0.22 < z < 0.48 and for several bins in stellar mass. 



constant or does it vary with halo mass? Do the pa- 
rameters that describe P(M ie \Mh) vary with redshift and 
galaxy type? Can we marginalize over uncertainties re- 
lated to the shapes and concentrations of dark matter 
halos? What exactly do we learn from various probe 
combinations? The challenges are steep but with in- 
creasing large data-sets such as the Dark Energy Sur- 
vey, the Large Synoptic Survey Telescope, the Hyper- 
Suprime Cam survey, and EUCLID, refined and sophis- 
ticated models can be built and constrained by the data. 
Although the model presented in this paper is sophisti- 
cated enough to describe COSMOS data, it is clear that 
further refinements will be necessary given the statisti- 
cal precision of up-coming surveys. Improving models 
such as the one presented in this paper by using insights 
provided, for example, by semi-analytic models of galaxy 
formation and dark matter N-body simulations, is clearly 
a worthy pursuit. 
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APPENDIX 

FURTHER DETAILS ON THE ORIGIN OF EQUATION 34 

We consider it useful to provide some more details on the origin of Equation [M] and in particular, on the link between 
the NFW profile and F' c and F' s . This might be useful for those who are not familiar with the notations of galaxy 
clustering studies. To begin with, consider a central galaxy that is associated with a NFW dark matter halo at redshift 
zl and with halo mass M. The NFW profile, p NFW , is given by: 



PnfwC'*! z l) 
Petit 



(r/R s )(l + r/R s ) 



2 ' 



(Al) 



where 5 is a characteristic (dimensionless) density and R s is the NFW scale radius. The relation between S and the 
NFW concentration parameter c is: 



A c 3 
~ 3 X ln(l + c)-c/(l + c)' 



(A2) 



where A is a chosen over-density (for example, A is often set to 200). The NFW radius is noted Rh and is equal to 
Rh = c x R s . The projected surface mass density of this lens, £, is computed by taking the integral of p N Fw over the 
line-of-sight: 



>0" p ,co, z L \M) = J p NFW (^Jr^ co + ■ 



z L dri 



(A3) 



Analytical expressions for the projection of p NFW to S can be found in [Wright fc Brainerdl ([2000D for example. 
Instead of a single galaxy, now consider an ensemble of central galaxies characterized by the central occupation 
function (N cen ). We note n c such that: 
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Fig. 15. — The mean and dispersion of the angular clustering of galaxies as a function of stellar mass threshold in our mock COSMOS 
simulations. The results shown here are for 0.48 < z < 0.74. The filled circles in each panel show w(9) for a single mock with > lOx the 
area as COSMOS itself. The difference between the large-area mock and the mean of the COSMOS mocks is due to the integral constraint. 
The bottom right panel shows the ratio of w(8) for the large-area mock divided by the mean of the COSMOS mocks. The solid curve is 
a fitting function to account for the IC. For the 0.22 < z < 0.48, the effect of the integral constraint is stronger, while for 0.74 < z < 1.0 
there is sufficient volume such that the IC is unity on all scales measured. 



The probability that a galaxy in this selection lives in a halo of mass M is: 

P(M) = {N cen )±^-.dM. 

Tlr Q.M 



(A5) 



The average surface mass density of the galaxy ensemble is: 



S co (r p , co ,z L )= / / T, co (r PtCO , z L \M)P{M)dMdr {ltCO 

1 dn 



Let us now define F' such that: 



ft F w (y^p.co + rji.co'^) (^cn)^^ydMdr|| iC0 . (A6) 



Combining Equation IA6I and Equation IA7I we obtain: 



dn M 1 „, / \/ r p,co + r j|.co 



Finally, Equation [34] is obtained by considering a galaxy sample that contains both central and satellite galaxies. In 
this case n g is defined as: 
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C A 
n g = I ((JVcen) + (JV«rt)) ^7 dM ' ( A9 ) 

is defined in a similar fashion to Equation IA7I but p NFW is replaced with the convolution of the NFW profile with 
it self. Analytic expre ssions for the convolution of the truncated NFW profile with itself can be found in the Appendix 
of lSheth et all (l200l . 
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